SEQUENCE LISTING 



<110> Nielsen, Bjarne Roenfeldt 
Svendsen, Allan 
Pedersen, Henrik 
Vind, Jesper 
Hendriksen, Hanne Vang 
Frandsen, Torben Peter 

<120> Glucoamylase Variants 



<130> 5636.200-US 

<140> 09/351,814 
<141> 1999-07-12 

<150> PA 1998 00937 
<151> 1998-07-13 

<150> PA 1998 01667 
<151> 1998-12-17 

<150> 60/093,528 
<151> 1998-07-21 

<150> 60/115,545 
<151> 1999-01-12 

<160> 81 

<170> FastSEQ for Windows Version 3.0 . 

<210> 1 
<211> 1605 
<212> DNA 

<213> Aspergillus niger 
<220> 

<221> sig_peptide 
<222> (1) . . . (72) 

<221> mat_peptide 
<222> (73) . . . (1602) 

<221> CDS 

<222> (1) . . . (1602) 

<400> 1 

atg teg ttc cga tct eta etc gcc ctg age ggc etc gtc tgc aca ggg 48 

Met Ser Phe Arg Ser Leu Leu Ala Leu Ser Gly Leu Val Cys Thr Gly 
-20 -15 -10 

ttg gca aat gtg att tec aag cge gcg acc ttg gat tea tgg ttg age 96 
Leu Ala Asn Val lie Ser Lys Arg Ala Thr Leu Asp Ser Trp Leu Ser 
-5 15 

aac gaa gcg acc gtg get cgt act gcc ate ctg aat aac ate ggg gcg 144 
Asn Glu Ala Thr Val Ala Arg Thr Ala lie Leu Asn Asn lie Gly Ala 

10 15 20 

gac ggt get tgg gtg teg ggc gcg gae tet ggc att gte gtt get agt 192 
Asp Gly Ala Trp Val Ser Gly Ala Asp Ser Gly lie Val Val Ala Ser 
25 30 35 40 
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ccc age acg gat aac ccg gac tac ttc tac acc tgg act cgc gac tct 
Pro Ser Thr Asp Asn Pro Asp Tyr Phe Tyr Thr Trp Thr Arg Asp Ser 



ggt etc gte etc aag acc etc gtc gat etc ttc ega aat gga gat ace 
Gly Leu Val Leu Lys Thr Leu Val Asp Leu Phe Arg Asn Gly Asp Thr 



agt etc etc tec acc att gag aac tac ate tec gcc cag gca att gte 

Ser Leu Leu Ser Thr He Glu Asn Tyr lie Ser Ala Gin Ala He Val 



cag ggt ate agt aac ccc tct ggt gat ctg tec age ggc get ggt etc 
Gin Gly He Ser Asn Pro Ser Gly Asp Leu Ser Ser Gly Ala Gly Leu 
90 95 100 

ggt gaa ccc aag ttc aat gtc gat gag act gcc tac act ggt tct tgg 
Gly Glu Pro Lys Phe Asn Val Asp Glu Thr Ala Tyr Thr Gly Ser Trp 
105 110 115 120 

gga egg ccg cag cga gat ggt ccg get ctg aga gca act get atg ate 
Gly Arg Pro Gin Arg Asp Gly Pro Ala Leu Arg Ala Thr Ala Met He 
125 130 135 

ggc ttc ggg cag tgg ctg ctt gac aat ggc tac acc age acc gca acg 
Gly Phe Gly Gin Trp Leu Leu Asp Asn Gly Tyr Thr Ser Thr Ala Thr 
140 145 150 

gac att gtt tgg ccc etc gtt agg aac gac ctg teg tat gtg get caa 
Asp He Val Trp Pro Leu Val Arg Asn Asp Leu Ser Tyr Val Ala Gin 
155 160 165 

tac tgg aac cag aca gga tat gat etc tgg gaa gaa gtc aat ggc teg 
Tyr Trp Asn Gin Thr Gly Tyr Asp Leu Trp Glu Glu Val Asn Gly Ser 
170 175 180 

tct ttc ttt acg att get gtg caa cac cgc gcc ctt gtc gaa ggt agt 
Ser Phe Phe Thr He Ala Val Gin His Arg Ala Leu Val Glu Gly Ser 

135 190 195 200 

gcc ttc gcg acg gcc gtc ggc teg tec tgc tec tgg tgt gat tct cag 
Ala Phe Ala Thr Ala Val Gly Ser Ser Cys Ser Trp Cys Asp Ser Gin 

205 210 215 

gca CGC gaa att etc tgc tac ctg cag tec ttc tgg acc ggc age ttc 
Ala Pro Glu He Leu Cys Tyr Leu Gin Ser Phe Trp Thr Gly Ser Phe 
220 225 • 230 

att ctg gcc aac ttc gat age age cgt tec ggc aag gac gea aac acc 
lie Leu Aia Asn Phe Asp Ser Ser Arg Ser Gly Lys Asp Ala Asn Thr 

235 240 245 

etc ctg gga age ate cac ace ttt gat cet gag gee gca tgc gac gae 
Leu Leu Gly Ser He His Thr Phe Asp Pro Glu Ala Ala Cys Asp Asp 
250 255 260 

tec ace ttc cag ccc tgc tec ccg cgc gcg etc gee aac cac aag gag 
Ser Thr Phe Gin Pro Cys Ser Pro Arg Ala Leu Ala Asn His Lys Glu 
265 270 275 280 

gtt gta gac tct ttc cgc tea ate tat acc etc aac gat ggt etc agt 
Val Val Asp Ser Phe Arg Ser He Tyr Thr Leu Asn Asp Gly Leu Ser 
285 290 295 

gae age gag get gtt gcg gtg ggt egg tac cet gag gac acg tac tac 
Asp Ser Glu Ala Val Ala Val Gly Arg Tyr Pro Glu Asp Thr Tyr Tyr 
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aac ggc aac ccg tgg ttc ctg tgc acc ttg get gcc gca gag cag ttg 
Asn Giy Asn Pro Trp Phe Leu Cys Thr Leu Ala Ala Ala Glu Gin Leu 
315 320 325 

tac gat get eta tac cag tgg gac aag cag ggg teg ttg gag gtc aca 
Tyr Asp Ala Leu Tyr Gin Trp Asp Lys Gin Gly Ser Leu Glu Val Thr 
330 335 340 

gat gtg teg ctg gac ttc ttc aag gca ctg tac age gat get get act 
Asp Val Ser Leu Asp Phe Phe Lys Ala Leu Tyr Ser Asp Ala Ala Thr 
345 350 355 360 

ggc acc tac tot teg tec agt teg act tat agt age att gta gat gcc 
Gly Thr Tyr Ser Ser Ser Ser Ser Thr Tyr Ser Ser lie Val Asp Ala 

365 370 375 

gtg aag act ttc gcc gat ggc ttc gtc tct att gtg gaa act cac gcc 
Val Lys Thr Phe Ala Asp Giy Phe Val Ser lie Val Glu Thr His Ala 
380 385 390 

gca age aac ggc tec atg tec gag caa tac gac aag tct gat ggc gag 
Ala Ser Asn Gly Ser Met Ser Glu Gin Tyr Asp Lys Ser Asp Gly Glu 
395 - - 400 405 

cag ctt tec get cgc gac ctg acc tgg tct tat get get ctg ctg acc 
Gin Leu Ser Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala Leu Leu Thr 

410 415 420 

gcc aac aac cgt cgt aac tec gtc gtg cet get tct tgg ggc gag acc 
Ala Asn Asn Arg Arg Asn Ser Val Val Pro Ala Ser Trp Gly Glu Thr 
425 430 435 440 

tct gcc age age gtg ecc ggc acc tgt gcg gcc aca tct gcc att ggt 
Ser Ala Ser Ser Val Pro Gly Thr Cys Ala Ala Thr Ser Ala He Gly 
445 450 455 

ace tac age agt gtg act gtc ace teg tgg ccg agt ate gtg get act 
Thr Tyr Ser Ser Val Thr Val Thr Ser Trp Pro Ser He Val Ala Thr 
460 465 470 

ggc ggc ace act aeg acg get acc ccc act gga tec ggc age gtg ace 
Gly Gly Thr Thr Thr Thr Ala Thr Pro Thr Gly Ser Gly Ser Val Thr 
475 480 485 

teg acc age aag ace acc gcg act get age aag acc age acc aeg acc 
Ser Thr Ser Lys Thr Thr Ala Thr Ala Ser Lys Thr S§r Thr Thr Thr 
490 495 500 

cgc tct ggt atg tea ctg tga 
Arg Ser Gly Met Ser Leu 
505 510 



<210> 2 
<211> 534 
<212> PRT 

<213> Aspergillus niger 

<220> 

<221> SIGNAL 
<222> (1) . . . (24) 

<400> 2 

Met Ser Phe Arg Ser Leu Leu Ala Leu Ser Gly Leu Val Cys Thr Gly 
-20 -15 -10 
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Leu Ala Asn Val lie Ser Lys Arg Ala Thr Leu Asp Ser Trp Leu Ser 

-5 15 
Asn Glu Ala Thr Val Ala Arg Thr Ala lie Leu Asn Asn lie Gly Ala 

10 15 20 

Asp Gly Ala Trp Val Ser Gly Ala Asp Ser Gly He Val Val Ala Ser 
25 30, 35 40 

Pro Ser Thr Asp Asn Pro Asp Tyr Phe Tyr Thr Trp Thr Arg Asp Ser 

45 50 55 

Gly Leu Val Leu Lys Thr Leu Val Asp Leu Phe Arg Asn Gly Asp Thr 

60 • 65 70 

Ser Leu Leu Ser Thr He Glu Asn Tyr He Ser Ala Gin Ala He Val 

75 80 85 

Gin Gly lie Ser Asn Pro Ser Gly Asp Leu Ser Ser Gly Ala Gly Leu 

90 95 100 

Gly Glu Pro Lys Phe Asn Val Asp Glu Thr Ala Tyr Thr Gly Ser Trp 
105 110 115 120 

Gly Arg Pro Gin Arg Asp Gly Pro Ala Leu Arg Ala Thr Ala Met He 

125 130 135 

Gly Phe Gly Gin Trp Leu. Leu Asp Asn Gly Tyr Thr Ser Thr Ala Thr 

140 145 150 

Asp He Val Trp Pro Leu Val Arg Asn Asp Leu Ser Tyr Val Ala Gin 

155 160 165 

Tyr Trp Asn Gin Thr Gly Tyr Asp Leu Trp Glu Glu Val Asn Gly Ser 

170 175 180 

Ser Phe Phe Thr He Ala Val Gin His Arg Ala Leu Val Glu Gly Ser 
185 190 195 200 

Ala Phe Ala Thr Ala Val Gly Ser Ser Cys Ser Trp Cys Asp Ser Gin 

205 210 215 

Ala Pro Glu He Leu Cys Tyr Leu Gin Ser Phe Trp Thr Gly Ser Phe 

220 225 230 

He Leu Ala Asn Phe Asp Ser Ser Arg Ser Gly Lys Asp Ala Asn Thr 

235 240 245 

Leu Leu Gly Ser He His Thr Phe Asp Pro Glu Ala Ala Cys Asp Asp 

250 255 260 

Ser Thr Phe Gin Pro Cys Ser Pro Arg Ala Leu Ala Asn His Lys Glu 
265 270 275 290 

Val Val Asp Ser Phe Arg Ser He Tyr Thr Leu Asn Asp Gly Leu Ser 

285 290 295 

Asp Ser Glu Ala Val Ala Val Gly Arg Tyr Pro Glu Asp Thr Tyr Tyr 

300 305 310 

Asn Gly Asn Pro Trp Phe Leu Cys Thr Leu Ala Ala Ala Glu Gin Leu 

315 320 325 

Tyr Asp Ala Leu Tyr Gin Trp Asp Lys Gin Gly Ser Leu Glu Val Thr 

330 335 340 

Asp Val Ser Leu Asp Phe Phe Lys Ala Leu Tyr Ser Asp Ala Ala Thr 
345 350 355 360 

Gly Thr Tyr Ser Ser Ser Ser Ser Thr Tyr Ser Ser He Val Asp Ala 

365 370 * 375 

Val Lys Thr Phe Ala Asp Gly Phe Val Ser He Val Glu Thr His Ala 

380 385 390 

Ala Ser Asn Gly Ser Met Ser Glu Gin Tyr Asp Lys Ser Asp Gly Glu 

395 400 405 

Gin Leu Ser Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala Leu Leu Thr 

410 415 420 

Ala Asn Asn Arg Arg Asn Ser Val Val Pro Ala Ser Trp Gly Glu Thr 
425 430 435 440 

Ser Ala Ser Ser Val Pro Gly Thr Cys Ala Ala Thr Ser Ala He Gly 

445 450 455 

Tfir Tyr Ser Ser Val Thr Val Thr Ser Trp Pro Ser He Val Ala Thr 

460 465 470 

Gly Gly Thr Thr Thr Thr Ala Thr Pro Thr Gly Ser Gly Ser Val Thr 

475 480 485 

Ser Thr Ser Lys Thr Thr Ala Thr Ala Ser Lys Thr Ser Thr Thr Thr 

490 495 500 

Arg Ser Gly Met Ser Leu 
505 510 
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<210> 3 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 7258 
<400> 3 

gaatgacttg gttgacgcgt caccagtcac 30 

<210> 4 
<211> 68 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 21401 



<400> 4 

ggggatcatg ataggactag ccatattaat gaagggcata taccacgcct tggacctgcg 60 
ttatagcc 68 

<210> 5 
<211> 25" " 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 107581 



<400> 5 

gcaacgaagc gcccgtggct cgtac 25 

<210> 6 
<211> 88 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer FAMGIL 



<400> 6 

cgaagcgacc gtggctcgta ctgccatcta taacatcggc gcgtctgtgc gcggtggcat 60 
tgtcgttgct agtcccagca cggataac 98 

<210> 7 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer RAMGl 



<400> 7 

gatggcagta cgagccacgg tcgcttcg 28 

<210> 8 
<211> 75 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PRIMER FAMGIV 



<400> 8 

gtgtcgctgg acttcttcaa gaacctctta ccctactaca gtcgttatca ttgatgccgt 60 
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gaagactttc gccga 



<210> 9 
<211> 21 
<212> DNA 

<213> PRIMER RAMGVI 



<400> 9 
cttgaagaag tccagcgaca c 

<210> 10 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer FG2 



<400> 10 
catccccagg atccttactc agcaatg 

<210> 11 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<4C0> 11 

ctcaaacgac tcaccagcct ctagagt 27 

<210> 12 
<211> 2602 
<212> DNA 

<213> ASPERGILLOS NIGER 



<400> 12 

ttcgtcgcct aatgtctcgt ccgttcacaa actgaagagc ttgaagtggc gagatgtctc 60 

tgcaggaatt caagctagat gctaagcgat attgcatggc aatatgtgtt gatgcatgtg 120 

cttcttcctt cagcttcccc tcgtgcgagt gaggtttggc tataaattga agtggttggt 180 

cggggttccg tgaggggctg aagtgcttcc tcccttttag gcgcaactga gagcctgagc 24 0 

ttcatcccca gcatcattac acctcagcaa tgtcgttccg atctctactc gccctgagcg 300 

gcctcgtctg cacagggttg gcaaatgtga tttccaagcg cgcgaccttg gattcatggt 360 

tgagcaacga agcgaccgtg gctcgtactg ccatcctgaa taacatcggg gcggacggtg 420 

cttgggtgtc gggcgcggac tctggcattg tcgttgctag tcccagcacg gataacccgg 480 

actgtatgtt tcgagctcag atttagtatg agtgtgtcat tgattgattg atgctgactg 540 

gcgtgtcgtt tgttgtagac ttctacacct ggactcgcga ctctggtctc gtcctcaaga 600 

ccctcgtcga tctcttccga aatggagata ccagtctcct ctccaccatt gagaactaca 660 

tctccgccca ggcaattgtc cagggtatca gtaacccctc tggtgatctg tccagcggcg 720 

ctggtctcgg tgaacccaag ttcaatgtcg atgagactgc ctacactggt tcttggggac 780 

ggccgcagcg agatggtccg gctctgagag caactgctat gatcggcttc gggcagtggc 840 

tgcttgtatg ttctccaccc ccttgcgtct gatctgtgac atatgtagct gactggtcag 900 

gacaatggct acaccagcac cgcaacggac attgtttggc ccctcgttag gaacgacctg 960 

tcgtatgtgg ctcaatactg gaaccagaca ggatatggtg tgtttgtttt attttaaatt 1020 

tccaaagatg cgccagcaga gctaacccgc gatcgcagat ctctgggaag aagtcaatgg 1080 

ctcgtctttc tttacgattg ctgtgcaaca ccgcgccctt gtcgaaggta gtgccttcgc 1140 

gacggccgtc ggctcgtcct gctcctggtg tgattctcag gcacccgaaa ttctctgcta 1200 

cctgcagtcc ttctggaccg gcagcttcat tctggccaac ttcgatagca gccgttccgg 1260 

caaggacgca aacaccctcc tgggaagcat ccacaccttt gatcctgagg ccgcatgcga 1320 

cgactccacc ttccagccct gctccccgcg cgcgctcgcc aaccacaagg aggttgtaga 1380 

ctctttccgc tcaatctata ccctcaacga tggtctcagt gacagcgagg ctgttgcggt 14 40 

gggtcggtac cctgaggaca cgtactacaa cggcaacccg tggttcctgt gcaccttggc 1500 

tgccgcagag cagttgtacg atgctctata ccagtgggac aagcaggggt cgttggaggt 15 60 

cacagatgtg tcgctggact tcttcaaggc actgtacagc gatgctgcta ctggcaccta 1620 

ctcttcgtcc agttcgactt atagtagcat tgtagatgcc gtgaagactt tcgccgatgg 1680 

cttcgtctct attgtggtaa gtctacgcta gacaagcgct catgttgaca gagggtgcgt 1740 
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actaacagaa gtaggaaact 
agtctgatgg cgagcagctt 
ccgccaacaa ccgtcgtaac 
gcgtgcccgg cacctgtgcg 
cctcgtggcc gagtatcgtg 
ccggcagcgt gacctcgacc 
cgtcatcaac ctcctgtacc 
ccaccaccta cggcgagaac 
aaaccagcga cggcatagct 
atgtcactgt gactctgccg 
gcgatgactc cgtggagtgg 
gcggaacgtc gaccgcgacg 
tagttaaagg atggggatga 
taatagtagt gaaatggaag 
atccggaaat acagacaccg 

<210> 13 
<211> 540 
<212> PRT 
<213> ASPERGILLC 

<400> 13 



Met 


Ser Phe 




Ser 








Ser 


Gly 


Leu 


Val 


Cys 


Thr 


Gly 


1 






5 








10 










15 






Ala Asn 


Val 


ile 


Ser 


Lys Arg 


Ala 


Thr 


Leu 




Ser 


Trp 




Ser 






20 








25 










30 






Asn 


Glu Ala 


Thr 


Val 


Ala 


Arg Thr 


Ala 


lie 


Leu 


Asn 


Asn 


lie 


Gly 


Ala 




35 








40 










45 










Gly Ala 


Trp 


Val 


Ser 


Gly Ala 


Asp 


Ser 


Gly 


lie 


Val 


Val 


Ala 


Ser 




50 








55 








60 










Pro 


Ser Thr 


Asp 


Asn 


Pro 


Asp Tyr 


Phe 


Tyr 


Thr 


Trp 


Thr 


Arg 


Asp 


Ser 


65 








70 








75 










80 


Gly 


Leu Val 


Leu 


Lys 


Thr 


Leu Val 


Asp 


Leu 


Phe 


Arg 


Asn 


Gly Asp 


Thr 








85 








90 










95 




Ser 


Leu Leu 


Ser 


Thr 


lie 


Glu Asn 


Tyr 


He 


Ser 


Ala 


Gin 


Ala 


lie 


Val 






100 








105 










110 






Gin 


Gly He 


Ser 


Asn 


Pro 


Ser Gly 


Asp 


Leu 


Ser 


Ser 


Gly 


Ala 


Gly 


Leu 




115 








120 










125 








Gly Glu Pro 


Lys 


Phe 


Asn 


Val Asp 


Glu 


Thr 


Ala 


Tyr Thr 


Gly 


Ser 


Trp 




130 








135 








140 








Gly 


Arg Pro 


Gin 




Asp 


Gly Pro 


Ala 


Leu 




Ala 


Thr 


Ala 


Met 


He 


145 








150 








155 










160 


Gly 


Phe Gly 


Gin 


Trp 


Leu 


Leu Asp 


Asn 


Gly 


Tyr 


Thr 


Ser 


Thr 


Ala 


Thr 








165 








170 










175 




Asp 


He Val 


Trp 


Pro 


Leu 


Val Arg 


Asn 


Asp 


Leu 


Ser 


Tyr 


Val 


Ala 


Gin 






180 








185 










190 






Tyr 


Trp Asn Gin 


Thr 


Gly Tyr Asp 


Leu Trp Glu Glu Val 


Asn 


Gly 


Ser 




195 








200 










205 








Ser 


Phe Phe 


Thr 


lie 


Ala 


Val Gin 


His 


Arg Ala 


Leu 


Val 


Glu 


Gly 


Ser 




210 








215 








220 










Ala 


Phe Ala 


Thr 


Ala 


Val Gly Ser 


Ser 


Cys 


Ser 


Trp 


Cys 


Asp 


Ser 


Gin 


225 








230 








235 










240 


Ala 


Pro Glu 


He 


Leu 


Cys 


Tyr Leu 


Gin 


Ser 


Phe 


Trp 


Thr 


Gly 


Ser 


Phe 








245 








250 










255 




lie 


Leu Ala 


Asn 


Phe 


Asp 


Ser Ser 


Arg 


Ser 


Gly 


Lys 


Asp 


Ala 


Asn 


Thr 






260 








265 










270 






Leu 


Leu Gly 


Ser 


lie 


His 


Thr Phe 


Asp 


Pro 


Glu 


Ala 


Ala 


Cys 


Asp 


Asp 




275 








280 










285 


Ser 


Thr Phe 


Gin 


Pro 


Cys 


Ser Pro 


Arg Ala 


Leu 


Ala 


Asn 


His 


Lys 


Glu 




290 








295 








300 








Val 


Val Asp 


Ser 


Phe 




Ser Ile 


Tyr Thr 


Leu 


Asn 


Asp 


Gly 




Ser 


305 








310 








315 










320 




Ser Glu 


Ala 


Val 


Ala 


Val Gly 


Arg 


Tyr 


Pro 


Glu 


Asp 


Thr 


Tyr 


Tyr 








325 








330 










335 




Asn Gly Asn Pro 


Trp 


Phe 


Leu Cys 


Thr 


Leu 


Ala 


Ala 


Ala 


Glu 


Gin 


Leu 






340 








345 










350 






Tyr Asp Ala 


Leu 


Tyr 


Gin Trp Asp 


Lys 


Gin Gly Ser Leu 


Glu 


Val 


Thr 



cacgccgcaa gcaacggctc catgtccgag caatacgaca 1800 

tccgctcgcg acctgacctg gtcttatgct gctctgctga 13 60 

tccgtcgtgc ctgcttcttg gggcgagacc tctgccagca 1920 

gccacatctg ccattggtac ctacagcagt gtgactgtca 1980 

gctactggcg gcaccactac gacggctacc cccactggat 2040 

agcaagacca ccgcgactgc tagcaagacc agcaccagta 2100 

actcccaccg ccgtggctgt gactttcgat ctgacagcta 2160 

atctacctgg tcggatcgat ctctcagctg ggtgactggg 2220 

ctgagtgctg acaagtacac ttccagcgac ccgctctggt 2280 

gctggtgagt cgtttgagta caagtttatc cgcattgaga 2340 

gagagtgatc ccaaccgaga atacaccgtt cctcaggcgt 2400 

gtgactgaca cctggcggtg acaatcaatc catttcgcta 24 60 

gggcaattgg ttatatgatc atgtatgtag tgggtgtgca 2520 

ccaagtcatg tgattgtaat cgaccgacgg aattgaggat 2580 

gg 2602 



US NIGER 
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355 






360 










3 65 








Asp 


Val 


Ser 


Leu Asp Phe 


Phe 


Lys 


Ala 


Leu 


Tyr 


Ser 


Asp 


Ala 


Ala 


Thr 




370 






375 










380 










Gly 


Thr 


Tyr 


Ser Ser Ser 


Ser 


Ser 


Thr 


Tyr 


Ser 


Ser 


lie 


Val 


Asp 


Ala 


385 






390 










395 








400 


Val 


Lys 


Thr 


Phe Ala Asp 


Gly 


Phe 


Val 


Ser 


lie 


Val 


Glu 


Thr 


His 


Ala 








405 








410 










415 




Ala 


Ser 


Asn 


Gly Ser Met 


Ser 


Glu 


Gin 


Tyr 


Asp 


Lys 


Ser 


Asp 


Gly Glu 








420 






425 










430 






Gin 


Leu 


Ser 


Ala Arg Asp 


Leu 


Thr 


Trp 


Ser 


Tyr 


Ala 


Ala 




Leu 


Thr 






435 






440 










445 








Ala 


Asn 


Asn 


Arg Arg Asn 


Ser 


Val 


Val 


Pro 


Ala 


Ser 


Trp Gly 


Glu 


Thr 




450 






455 










4 60 










Ser 


Ala 


Ser 


Ser Val Pro 


Gly 


Thr 


Cys 


Ala 


Ala 


Thr 


Ser 


Ala 


He 


Gly 


465 






470 










475 










480 


Thr 


Tyr 


Ser 


Ser Val Thr 


Val 


Thr 


Ser 


Trp 


Pro 


Ser 


lie 


Val 


Ala 


Thr 








485 








490 










495 




Gly 


Gly 


Thr 


Thr Thr Thr 


Ala 


Thr 


Pro 


Thr 


Gly 


Ser 


Gly 


Ser 


Val 


Thr 








500 






505 






510 






Ser 


Thr 


Ser 


Lys Thr Thr 


Ala 


Thr 


Ala 


Ser 


Lys 


Thr 


Ser 


Thr 


Ser 


Thr 






515 






520 








525 








Ser 


Ser 


Thr 


Ser Cys Thr 


Thr 


Pro 


Thr 


Ala 


Val 


Ala 


Val 


Thr 


Phe 


Asp 




530 






535 










540 










Leu 


Thr 


Ala 


Thr Thr Thr 


Tyr 


Gly 


Glu 


Asn 


lie 


Tyr 




Val 


Gly 


Ser 


545 






550 










555 






560 


lie 


Ser 


Gin 


Leu Gly Asp 


Trp 


Glu 


Thr 


Ser 


Asp 


Gly 


lie 


Ala 


Leu 


Ser 


























575 




Ala 


Asp 


Lys 


Tyr Thr Ser 


Ser 


Asp 


Pro 


Leu 


Trp 


Tyr 


Val 


Thr 


Val 


Thr 








580 






585 










590 








Pro 


Ala 


Gly Glu Ser 


Phe 


Glu 


Tyr 




Phe 


lie 




lie 


Glu 


Ser 






595 






600 










605 








Asp 


Asp 


Ser 


Val Glu Trp 


Glu 


Ser 


Asp 


Pro 


Asn 


Arg 


Glu Tyr 


Thr 


Val 




610 






615 










620 










Pro 


Gin 


Ala 


Cys Gly Thr 


Ser 


Thr 


Ala 


Thr 


Val 


Thr 


Asp 


Thr 


Trp 


Arg 


625 






630 










635 








640 



<210> 14 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer K1-T2X 

n at positions 22 and 23 is a or g or c or t 

<400> 14 

atgtgatttc caagcgcgcg vnnttggatt catggttgag caa 

<210> 15 

<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-N9X 

n at positions 22 and 23 is a or g or c or t 

<400> 15 

ccttggattc atggttgagc vnngaagcga ccgtggctcg tac 

<210> 16 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-AllX 
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n at positions 22 and 23 is a or g or c or t 



<400> 16 

attcatggtt gagcaacgaa vnnaccgtgg ctcgtactgc cat 

<210> 17 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-L66X 

n at positions 22 and 23 is a or g or c or t 

<400> 17 

tcctcaagac cctcgtcgat vnnttccgaa atggagatac cag 

<210> 18 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-S386X 

n at 'positions 22 and 23 is a or g or c or t 

<400> 18 

ctttcgccga tggcttcgtc vnnattgtgg aaactcacgc cgc 

<210> 19 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-E389X 

n at positions 22 and 23 is a or g or c or t 

<400> 19 

atggcttcgt ctctattgtg vnnactcacg ccgcaagcaa egg 

<210> 20 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-T390X 

n at positions 22 and 23 are a or g or c 

<400> 20 

gcttcgtctc tattgtggaa vnncacgccg caagcaacgg etc 

<210> 21 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-A393X 

n at positions 22 and 23 is a or g or c or t 

<400> 21 

ctattgtgga aactcacgcc vnnagcaacg gctccatgtc cga 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer Hkl-S394X 

n at positions 22 and 23 is a or g or c or t 



<400> 22 

ttgtggaaac tcacgccgca vnnaacggct ccatgtccga gca 43 

<210> 23 
<2I1> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-395X 

n at positions 22 and 23 is a or g or c or t 

<400> 23 

tggaaactca cgccgcaagc vnnggctcca tgtccgagca ata 43 

<210> 24 
<211> 43" • 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> primer Hkl-G396X 

n at positions 22 and 23 is a or g or c or t 

<400> 24 

aaactcacgc cgcaagcaac vnntccatgt ccgagcaata cga 43 

<210> 25 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-K404X 

n at positions 22 and 23 is a or g or c or t 

<400> 25 

ccatgtccga gcaatacgac vnntctgatg gcgagcagct ttc 43 

<210> 26 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-D406X 

n at positions 22 and 23 is a or g or c or t 

<400> 26 

ccgagcaata cgacaagtct vnnggcgagc agctttccgc teg 43 

<210> 27 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-E408X 

n at positions 22 and 23 is a or g or c or t 
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<400> 27 

aatacgacaa gtctgatggc vnncagcttt ccgctcgcga cct 



<210> 28 
<211> 42 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer Hkl-L410X 

n at positions 22 and 23 is a or g or c or t 

<400> 23 

acaagtctga tggcgagcag vnntccgctc gcgacctgac ct 42 

<210> 29 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-L423X 

n at positions 22 and 23 is a or g or c or t 

<400> 29 

cctggtctta tgctgctctg vnnaccgcca acaaccgtcg taa 43 

<210> 30 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-N42 6X 

n at positions 22 and 23 is a or g or c or t 

<400> 30 

atgctgctct gctgaccgcc vnnaaccgtc gtaactccgt cgtg 44 

<210> 31 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-N427X 

n at positions 22 and 23 is a or g or*c or t 

<400> 31 

ctgctctgct gaccgccaac vnncgtcgta actccgtcgt gcct 44 

<210> 32 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hkl-Y402X 

n at positions 21 and 22 is a or g or c or t 

<400> 32 

acggctccat gtccgagcaa nncgacaagt ctgatggcga gcagct 4 6 

<210> 33 
<211> 41 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> primer Hk2-L234X-sense 

n at positions 20 and 21 is a or g or c or t 

<400> 33 

ctggaccggc agcttcattn nkgccaactt cgatagcagc c 41 

<210> 34 
<211> 42 
<2I2> DNA 

<2I3> Artificial Sequence 
<220> 

<223> primer Hk2-A235S-antisense 



<400> 34 

gaacggctgc tatcgaagtt agacagaatg aagctgccgg tc 42 

<210> 35 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-NF237X-sense 

n at position 20 is a or g or c or t 

<400> 35 

cagcttcatt ctggccaacn atgatagcag ccgttccggc a 41 

<210> 36 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-D235T-antis6nse 
<400> 36 

ccttgccgga acggctgcta gtgaagttgg ccagaatgaa gc 42 

<210> 37 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-D238S-antisen3e 
<400> 37 

ccttgccgga acggctgcta gagaagttgg ccagaatgaa gc 42 

<210> 33 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-S239X-sense 

n at positions 21 and 22 is a or g or c or t 

<400> 38 

tcattctggc caacttcgat nncagccgtt ccggcaagga eg 4 2 
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<210> 39 
<21l> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-S240G-anti3ense 



<400> 39 

ttgcgtcctt gccggaacga ccgctatcga agttggccag aa 42 

<210> 40 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-S242X-antisense 

n at position 22 is a or g or c or t 

<400> 40 

gggtgtttgc gtccttgcca knacggctgc tatcgaagtt g 41 

<210> 41' ■ 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-G243X-anti3ense 

n at position 22 is a or g or c or t 

<400> 41 

ggagggtgtt tgcgtcctta knggaacggc tgctatcgaa g 41 

<210> 42 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-K24 4R-sense 
<400> 42 

cgatagcagc cgttccggca gagacgcaaa caccctcctg g 41 

<210> 43 
<2H> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-T310V-antisen3e 
<400> 43 

acgggttgcc gttgtagtaa acgtcctcag ggtaccgacc c 41 

<210> 44 
<211> 41 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer Hk2-T310S-antisense 
<400> 44 
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acgggttgcc gttgtagtaa gagtcctcag ggtaccgacc c 

<210> 45 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-Y311N-sense 

<400> 45 

tcggtaccct gaggacacga attacaacgg caacccgtgg t 

<210> 46 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Hk2-Y312Q-antisense 
<400> 46 

ggaaccacgg gttgccgttt tggtacgtgt cctcagggta c 

<210> 47- - 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-Y312N-antisense 

<400> 47 

ggaaccacgg gttgccgtta ttgtacgtgt cctcagggta c 

<210> 48 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-N313T-sense 

<400> 48 

ccctgaggac acgtactaca ctggcaaccc gtggttcctg t 

<210> 49 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-N313S-sense 

<400> 49 

ccctgaggac acgtactact ctggcaaccc gtggttcctg t 

<210> 50 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-N313G-sense 
<400> 50 

ccctgaggac acgtactacg gtggcaaccc gtggttcctg t 
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<210> 51 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> primer Hk2-N315Q-antisense 
<400> 51 

aggtgcacag gaaccacggt tggccgttgt agtacgtgtc c 41 

<210> 52 
<211> 41 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer Hk2-N315E-antisense 

<400> 52 

aggtgcacag gaaccacggt tcgccgttgt agtacgtgtc c 41 

<210> 53 
<211> 41- - 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk2-N315R-antisense 

<400> 53 

aggtgcacag gaaccacggt ctgccgttgt agtacgtgtc c 41 

<210> 54 
<211> 41 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer Hk2-F318Y-antisense 

<400> 54 

cggcagccaa ggtgcacaga taccacgggt tgccgttgta g 41 

<210> 55 
<211> 41 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer Hk2-Q40 9P-sense 

<400> 55 

cgacaagtct gatggcgagc cactttccgc tcgcgacctg a 41 

<210> 56 
<211> 41 
<212> DNA 

<2I3> Artificial Sequence 
<220> 

<223> primer Hk3-D336X-sense 

n at positions 20 and 21 is a or g or c or t 

<400> 56 

cgatgctcta taccagtggn nkaagcaggg gtcgttggag g 41 
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<210> 57 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer H]c3-K337X-s6nse 

n at positions 20 and 21 is a or g or c or t 

<400> 57 

tgctctatac cagtgggacn nkcaggggtc gttggaggtc a 41 

<210> 58 
<2H> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-Q338X-antisense 

n at positions 21 and 22 is a or g or c or t 

<400> 58 

ctgtgacctc caacgacccg nncttgtccc actggtatag a 41 

<210> 59 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-G339X-sense 

n at position 20 is a or g or c or t 

<4C0> 59 

ataccagtgg gacaagcagn cutcgttgga ggtcacagat g 41 

<210> 60 

<211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-S340X'-antisense 

n at position 21 is a or g or c or t 

<400> 60 

acacatctgt gacctccaaa ntcccctgct tgtcccactg g * 41 

<210> 61 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-S340X"-antisense 

n at position 21 is a or g or c or t 

<400> 61 

acacatctgt gacctccaaa nccccctgct tgtcccactg g 41 

<210> 62 

<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> primer Hk3-L341X-sense 

n at position 20 is a or g or c or 

<400> 62 

gtgggacaag caggggtcgn uugaggtcac agatgtgtcg c 

<210> 63 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-K352Q-sense 
<400> 63 

tgtgtcgctg gacttcttcc aagcactgta cagcgatgct g 

<210> 64 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-K352R-sense 

<400> 64 

tgtgtcgctg gacttcttca gagcactgta cagcgatgct g 

<210> 65 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-A352D-antisense 

<400> 65 

tagcagcatc gctgtacaga tccttgaaga agtccagcga c 

<210> 66 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer HK3-A353S-antisense 
<400> 66 

tagcagcatc gctgtacaga gacttgaaga agtccagcga c 

<210> 67 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-S356P-sense 

<400> 67 

acttcttcaa ggcactgtac ccagatgctg ctactggcac ct 

<210> 68 
<211> 43 
<212> DNA 

<213> Artificial Sequence 



<220> 



<223> primer Hk3-S356N-sense 

<400> 68 

acttcttcaa ggcactgtac aaugatgctg ctactggcac eta 

<210> 69 

<211> 43 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer Hk3-S356D-sense 

<400> 69 

acttcttcaa ggcactgtac gaugatgctg ctactggcac eta 

<210> 70 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-D357S-antisense 
<400> 70- ■ 

gagtaggtgc cagtagcagc agagctgtac agtgccttga aga 

<210> 71 
<211> 41 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer Hk3-A359S-sense 

<400> 71 

ggcactgtac agcgatgctt ctactggcac ctactcttcg t 

<210> 72 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-T360V-antisense 

<400> 72 

tggacgaaga gtaggtgcca acagcagcat cgctgtacag t 

<210> 73 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-G361X-sense 

n at position 21 is a or g or c or t 

<400> 73 

tgtacagcga tgctgctact nctacctact cttcgtccag ttc 

<210> 74 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> primer Hk3-T362_R-antisense 



<400> 74 

gtcgaactgg acgaagagta tctgccagta gcagcatcgc tg 42 

<210> 75 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-S364X-sense 

n at positions 20 and 21 is a or g or c or t 

<400> 75 

tgctgctact ggcacctacn nktcgtccag ttcgacttat ag 42 

<210> 76 
<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-S365X-sense 

n at positions 20 and 21 is a or g or c or t 



<4G0> 76 

tgctactggc acctactctn nktccagttc gacttatagt ag 42 

<210> 77 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-S366T-antisense 
<400> 77 

atgctactat aagtcgaact agtcgaagag taggtgccag ta 42 

<210> 78 
<211> 42 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer Hk3-S368X-antisense 

n at position 23 is a or g or c or t 

<400> 78 

tctacaatgc tactataagt agnactggac gaagagtagg tg 42 

<210> 79 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-T369X-sense 

n at positions 20 and 21 is a or g or c or t 

<400> 79 

ctactcttcg tccagttcgn nktatagtag cattgtagat gcc 43 

<210> 80 
<211> 43 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-S371X-antisen3e 

n at position 23 is a or g or c or t 

<400> 80 

ttcacggcat ctacaatgct atnataagtc gaactggacg aag 43 

<210> 81 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer Hk3-S372X-sense 

n at positions 21 and 22 is a or g or c or t 

<400> 81 

cgtccagttc gacttatagt nntattgtag atgccgtgaa gac 43 
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